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ON THE CORRELATION BETWEEN SOMATIC CHAR- 
ACTERS AND FERTILITY. II. 

ILLUSTRATIONS FROM PHASEOLUS VULGARIS 

J. Arthur Harris 

I. Introductory Remarks 

The purpose of this paper, like that of the one which preceded it, 1 
is the presentation of quantitative data toward the solution of the 
relationship between the degree of development of the somatic organs 
of the plant individual and its fertility. 

The materials are drawn exclusively from pedigreed dwarf varieties 
of garden beans, Phaseolus vulgaris. These varieties are being de- 
scribed in biometric terms elsewhere, as shown in the bibliography; 
hence it is only necessary to designate the series by the key letters 
which open the detailed information of other papers. 

The characters here treated are number of pods per plant, number 
of ovules formed per pod and number of seeds matured per pod. 

II. Analysis of Data 

Table I shows the correlations, r v „, r vs , and the partial correlation 
r vs , when p = pods per plant, o = ovules per pod, s = seeds per pod. 
The actual number of plants and the number of pods counted are also 
entered. Because of the great bulkiness of the data it is not feasible 
to publish even in condensed form the 64 correlation tables from which 
these constants were deduced. 2 

The correlations of this table are also shown in diagram 1 . 

I turn first to the question of linearity of regression. This is es- 
sential statistically, since upon the distribution of the means of the 
arrays of ovules or seeds in a sensibly straight line depends the va- 

1 Bioraetrika 8: 52-65. 1911. 

2 The number of pods per plant was weighted with the number of pods counted 
in calculating mean and standard deviation of number of pods. Sheppard's modi- 
fication was nowhere applied, variation being by discrete units. The number of 
pods in which the number of ovules and seeds was determined was used as n in the 
calculation of the probable errors. 
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Diagram i. Magnitudes of correlations r p „ and r p , in 32 series of garden beans. 
The series are arranged in order of magnitude of r po : r p „ = solid lines and dots; 
r pa = broken lines and circles; r po = solid bar; f p , = broken bar. 



lidity of r as a measure of interdependence. It is of interest and import- 
ance biologically, since it shows whether the rate of modification of an 
associated character changes from one end to the other of the range of 
variation of the independent variable. 

The labor involved in testing graphically the agreement of theoreti- 
cal and empirical means for all of the series is prohibitive. Other 
methods of dealing with the problem are, as will be clear in a moment, 
unfavorable. I have, therefore, contented myself with the graphical 
treatment of a few illustrative cases (table II and diagrams 2 and 3). 

In the graphs the means for the full range of variation in number of 
pods per plant has not been shown. 8 This arises from the fact that 

3 The observed range of variation in number of pods per plant is indicated in 
the table. Limitations of space preclude the drawing of a scale for the means of 
ovules and seeds for each line, but it is the same for all and may be estimated fairly 
well by the reader from the end points of the lines as given in the table. 
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Pods per plant. 

Diagram 2. Regression of ovules per pod on number of pods per plant. The 
end points of the lines are given in the table. 



variation in the number of pods is very great 4 and that in consequence 
the extreme cases are represented by very few individuals. Thus in the 
HH series the number of pods per plant ranges from 1 to 67, but of the 

4 The coefficients of variation for the number of pods per plant (unweighted) for 

the series illustrated in the graphs are: 

HH = 51.02 FSS = 49.34 

HD = 48.97 GGD = 44.07 

GGH = 38.60 LL = 58.55 

USS = 38.36 

Mean = 46.99 
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Pods per plant. 
Diagram 3. Regression of seeds matured per pod on number of pods per 
plant. 

1 ,484 plants there are only 36 which produce more than 38 pods. To 
calculate means for arrays of such sizes is idle. Neither is it desirable, 
in view of the great differences in the range of variation from series to 
series, to combine classes in order to secure larger arrays. Any ob- 
jections based on the fewness of individuals in the arrays applicable to 
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Table II 

Equations for Regression of Ovules per Pod and Seeds per Pod on 

Number of Pods per Plant 



Series 


Regression Equation 


End Points of Line 


Range of Pods 


HH 


= 4.940 + .027 p 


1 = 4-97. 


38 = 5-96 


I-67 


HH 


s = 3-388 + .050 p 


1 = 3-44. 


38 = 5-28 


I-67 


HD 


= 4-375 + -070 p 


1 = 4.44, 


14 = 5-35 


I-I4 


HD 


s = 3.072 + .Oil p 


1 = 3-o8, 


14 = 3-23 


I-I4 


GGH 


= 5-300 4- .031 p 


1 = 5-33. 


22 = 5.99 


1-33 


GGH 


s = 3-743 + -046 p 


1 = 3-79. 


22 = 4.76 


1-33 


uss 


= 5-173 + -022 p 


6 = 5-3L 


27 = 5-77 


1-50 


uss 


* = 3-457 + -026 p 


6 = 3.62, 


27 = 4-17 


1-50 


FSS 


= 5.202 + .030 p 


3 = 5-23> 


30 = 6.11 


i-54 


FSS 


s = 2.536 + .047 p 


3 = 2.58, 


30 = 3-95 


1-54 


GGD 


= 4.561 + .089 p 


1 = 4-65> 


8 = 5-27 


1-8 


GGD 


S = 2.709 -j- .022 p 


1 = 2.73, 


8 = 2.88 


1-8 


LL 


= 4-950 + .013 p 


1 = 4.96, 


27 = 5-30 


1-27 


LL 


s = 2.515 + .014/) 


1 = 2.53. 


27 = 2.88 


1-27 



graphical methods of testing for linearity are equally valid, or even 
stronger, for formal tests, for example those based on a comparison of 
r and 1?. For o correlation y may have a substantial value when the 
arrays are numerous; it is in just such cases as this that very mis- 
leading results might be obtained through a careless application of 
Blakeman's test. 

In the graphs the theoretical lines are drawn according to the 
equations calculated from the whole material, but are not extended 
beyond classes of pods per plant represented by only five individual 
plants or fewer. 

The slope of all the lines is slight and the scatter of the empirical 
means decidedly large. This is precisely the condition to be expected 
in widely varying and but slightly correlated characters. For the 
benefit of those who may not be familiar with either the biological or 
the statistical phases of the study it may be pointed out that the ir- 
regularities seen in these diagrams are neither due to carelessness in the 
biological experiments nor to inadequacies of the statical methods, but 
solely to the difficulties inherent in the material. 

With regard to the critical question, that of the sensible linearity or 
non-linearity of the empirical means, the diagrams do not— and largely 
because of this great irregularity — furnish a final answer. But while 
there is some indication of significant non-linearity, for practical pur- 
poses a straight line describes the change in and 5 associated with an 
increase in p with sufficient accuracy. 
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Having justified the use of r 6 as a constant describing the degree of 
interdependence between p and o and s, I turn to the constants them- 
selves. 

The correlations are conspicuous for their irregularity. Practically 
speaking, however, they are positive throughout. This is strictly true 
for the relationship between pods per plant and ovules per pod. It 
holds for 29 of the 32 measures of the relationship between pods per 
plant and seeds per pod ; of the 3 negative correlations all are small and 
neither can be safely regarded as statistically trustworthy with regard 
to its probable error. 

Computing the physical constants of these varying values of cor- 
relation I find 

Tpo 1 pa 

Mean 1948 ± .0107 .1264 ± .0109 

Standard deviation 0899 ± .0075 .0915 ± .0077 

Coefficient of variation 46.16 72.38 

The relationship between number of pods and ovules per pod is 
apparently higher than that between the number of pods and the num- 
ber of seeds matured. The first of these may be considered a more 
strictly morphogenetic relationship, the second a more truly physiolo- 
gical one. The difference, .0684 ± .0153, is nearly four and a half 
times its probable error, and so perhaps significant. 

Further evidence may be obtained upon the problem of the relative 
values of the two correlations by considering the differences in the two 
constants for each experiment separately, as shown in table I. In 7 
cases r PS > r po , but in 25 cases r po > r ps . Taking the ratio of the 
differences to their probable errors, it appears that 3 of the cases in 
which the correlation for seeds is higher than that for ovules may pos- 
sibly be considered significant ( > 2.5) while there are 18 cases in which 
the correlation for ovules is higher that may be taken to be trustworthy 
statistically. The mean value of the ratios of the positive differences, 
f P s > r p0 , to their probable errors is 2.56, while that for the negative 
differences is 4.96. 

It is, therefore, clearly demonstrated that although the values of the 
correlations are both low and irregular, those measuring the relation- 
ship between the number of pods and the number of ovules are sensibly 
higher than those measuring the relationship between the number of 
pods and the number of seeds matured per pod. 

6 Even if regression is not strictly linear, r is the best constant to be used, since »j 
would be too largely affected by the errors of sampling in the numerous small arrays. 
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There can be little doubt that the observed statistical correlation 
between the number of pods per plant and the number of ovules laid 
down per pod describes a real morphogenetic interdependence. The 
constants further show that the relationship for the particular charac- 
ters under consideration is not strict but lax. 

The biological interpretation of the numerical values for number 
of pods and number of seeds matured per pod is not so simple. Here 
a second set of innate and environmental physiological factors — those 
involved in determining whether a given ovule shall die or develop — 
are superimposed upon the morphogenetic and physiological complex 
involved in determining the degree of development of p and o. 

The most patent effect of these superimposed physiological factors 
is the death (non-development) of a considerable proportion of the 
ovules. The number failing to develop into seeds varies widely from 
pod to pod, but on the average it is roughly, though as I have shown 
(19136) not exactly proportional to the number of ovules per pod. As 
a result there is a correlation of medium intensity between the number 
of seeds developing per pod and number of ovules formed (1912^). 

Clearly, therefore, there may be (at least) two quite independent 
sets of factors influencing the correlation between p and s. The first 
of these is involved in the interdependences rpo and r<«; as a resultant 
of these forces some relationship must exist between p and s. It is 
conceivable that this correlation which indicates no direct and inde- 
pendent physiological or morphogenetic nexus between p and 5 may 
account for the whole of the observed value of rps. 

To correct for the influence of rpo and ros on rps, I have recourse to 
the partial correlations between pods per plant and seeds per pod for 
constant numbers of ovules per pod. These have been calculated 
from the usual formula 

T-ns 'VO'OS 



° rps ~ vr_ , * 



' po 



•VI 



for each of the series. The values for rpo, rps are of course those of 
this paper. Those for ros have already been published (1912(f). 

The values, table I, are also shown graphically in comparison with 
the correlations rps in diagram 4. The reduction in the intensity of 
the relationship between number of pods and number of seeds when 
correction is made for the relationship of these two characters with 
number of ovules per pod is clearly marked: it occurs in every in- 
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the 



Series of plants. 

Diagram 4. Comparison of the correlations r ps (solid dots and firm line) and 
partial correlations „> (circles and broken line) for individual series. 



dividual case ; only 3 of the values of tp» have the negative sign while 
10 of the partial correlation coefficients, orp», fall below the zero bar: 
rps = . 1264 ± .0109, orps ,= .0507 ± .0090. 

Notwithstanding the material reduction in r P s when correction is 
made for the influence of rp», ros there seems to be a residuum of 
interdependence which cannot be thus accounted for. Evidence for 
this conclusion is seen in four facts, (a) The mean partial correlation 
has a positive and perhaps significantly positive value: oTps — .0507 
± .0090, oTps/E oFps = 5.64. (b) Of the individual values of ofps 
only 10 of the 32 are negative in sign, (c) The mean value of the nega- 
tive coefficients is only — .0347 while that of the positive constants is 
+ .0895. (d) Of the 10 negative coefficients 5 may perhaps be con- 
sidered statistically significant with regard to their probable error, 
although this number would be reduced if n were taken as plants in- 
stead of pods in the calculation of the probable error. Of the positive 
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coefficients 19 are over 2.5 times their probable error, as compared 
with the 5 possibly significant values noted for the negative. Further- 
more, the mean of the ratios of the 10 negative coefficients to their 
probable error is only 2.49 as compared with a mean ratio of 9.74 for 
the positive coefficients. 

Thus the evidence seems to justify fully the assertion that there is a 
slight physiological relationship between the number of pods which a 
plant produces and the number of seeds which it is capable of maturing 
in these pods, and that this relationship is quantitatively independent 
of the more strictly morphogenetic factors linking together number of 
pods and number of ovules per pod. 

Further than this the analysis cannot safely be pushed on the basis 
of available data. 

III. Comparison of the Foregoing Constants with other 
Correlations for Somatic Characters and fertility 

Since many additional series of data bearing on these problems are 
in hand and will eventually be published, I shall not compare in detail 
the results set forth here with others but I shall indicate merely the 
order of correlations which have been found. 

For length of flowering stalk and number of flowers in the umbel- 
like inflorescence of two liliaceous plants the correlations are roundly 6 

For Nothoscordium striatum, r — .500, 

For Allium stellatum, r = .620. 

For the relationship between the length of the flowering stalk 
(pedicel) and number of ovules and seeds per fruit the only available 
data are 7 

For Sanguinaria, ru = .323 ± .019, 

For Sanguinaria, ru = .363 ± .019. 
Here ru > rio, but ru — rio = .040 ± .027 only. Thus the difference 
is hardly significant with regard to its probable error; there is only a 
single series of material; further collections might show that here, as 
in garden beans, the correlation for degree of vegetative development 
and number of seeds is on the average lower than that for vegetative 
development and number of ovules per pod. The partial correlation 
in the case of Sanguinaria would of course have a significantly positive 
value. 

6 Ann. Rept. Mo. Bot. Gard. 20: 105-115. 1909. 

7 Biometrika 7: 316. 1910. 
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In these three cases the bulb or rootstock sends up only a single 
inflorescence. The correlation for length and ovules and seeds in 
Sanguinaria is much closer to the rv and rps in beans than are the cor- 
relations in Nothoscordium and Allium. This is perhaps to be expected, 
since in the two Liliaceae the correlation is between the length of a 
main axis and the number of secondary axes originating from it, while 
in Sanguinaria and in the beans the correlation is (roughly speaking) 
between the degree of development of the axis and the characteristics 
of the ovaries which it produces. 

That the correlations in Sanguinaria are sensibly higher than the 
average values for beans may be due in part to chance — a few of the 
values for beans reaching almost the magnitude of those found in the 
blood-root. It is perhaps more probable that the higher value in 
Sanguinaria is due to the facts (a) that the correlation is between two 
intimately associated organs — a single fruit terminating the simple 
axis the length of which furnishes the measure of the degree of somatic 
development, whereas in garden beans the axis is extensively divided — 
and (b) that Sanguinaria is a perennial and there is doubtless consid- 
erable age heterogeneity which probably tends to raise the correlation. 

If one extends the range of comparison to include the relationships 
within inflorescences there are several available constants. 

In Crinum longifolium* I have found for fruits per inflorescence 
and seeds per fruit, rfs =- .072 ± .024. For the inflorescence of 
Celastrus scandens 9 I have shown 

For flowers and seeds, r = .033 ± .013, 

For fruits and seeds, r = — .012 ± .013. 

In Staphylea the correlation between number of fruits per inflores- 
cence and number of ovules and seeds per pod has been investigated 10 
for series of individual shrubs, as well as for random collections from 
numbers of shrubs. It is difficult in these cases to be sure of even the 
sign of the correlation. The correlation for number of fruits and length 
of fruit is low. 11 Those for number of fruits and radial asymmetry and 
locular composition are also very small. 12 

For three series of intact inflorescences from individual trees of the 
legume Cercis canadensis in which the number of ovules in the ovaries 

8 Ann. Rept. Mo. Bot. Gard. 23: 89-91. 1912. 
'Ann. Rept. Mo. Bot. Gard. 20: 120-122. 1909. 

10 Beih. Bot. Centralbl., Abt. 1, 28: 2-10. 1911. 

11 Bot. Gaz. 53: 401-402. 1912. 

12 Zeitschr. Ind. Abst.- u. Vererbungslehre 5: 183-187. 191 1. 
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was determined by clearing with alcohol and counting under the mic- 
roscope, the results are 13 

Tree I, r = - .007 ± .023, 

Tree 2, r = .030 ± .021, 

Tree 3, r = .134 ± .024, 
where the probable errors are calculated on the basis of the number of 
ovaries dissected out and examined, not the number of inflorescences. 

Thus the correlations between the number of flowers formed or 
number of fruits matured per inflorescence and the fertility of the 
fruits — either as measured by the number of ovules laid down or the 
number of seeds ripened — are without exception very low indeed. 

IV. Summary and Discussion 

This paper, which is one of a series on the various aspects of fer- 
tility and fecundity in plants, is a contribution of data towards the 
solution of the problem of the relationship between the degree of so- 
matic development of the plant individual as measured by the number 
of fruits which it bears, and the fertility of these fruits as measured by 
the number of ovules formed and the number of seeds developing. 

The data comprise the records of number of ovules formed and 
number of seeds matured in 127,610 pods of garden beans, Phaseolus 
vulgaris, from 19,064 plants with known number of pods drawn from 32 
cultures made under a wide range of environmental conditions and 
embracing five different varieties. 

Taken in connection with data recorded in other papers they per- 
mit the consideration of points not touched upon here. It is perhaps 
fair to state that these are to be discussed in subsequent papers where 
extensive additional series of records already in hand can be reduced. 

The analyses of the data as far as carried out lead to the folowing 
conclusions : 

(1) The correlation between number of pods per plant and number 
of ovules per pod has always been found positive but low, ranging from 
.023 to .355, with an average value of about rpo = .195. 

(2) For pods per plant and seeds per pod the correlations are also 
for the most part positive, although constants which have the negative 
sign but are insignificant with regard to their probable errors some- 
times occur. 

These values range from — .046 to .338 with a mean of r P , = .126. 
13 Bot. Gaz. 53: 403-404. 1912. 
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By all available tests the coefficients for pods per plant and ovules per 
pod seem to be significantly higher than those for pods per plant and 
seeds per pod. 

(3) As a resultant of the relationship between pods per plant and 
ovules per pod (?>) and that demonstrated elsewhere between ovules 
and seeds per pod (/<>«) some correlation must be expected between 
number of pods per plant and number of seeds per pod, whether there 
be any direct physiological interdependence between these two charac- 
ters or not. 

(4) By determining the correlation between p and 5 for constant 
values of „. 4 , e . by calculating otv by the usual partial correlation 
formulae for three variables, I have tried to remove the influence of 
the interrelationships of 0, p and 5 upon the coefficient n>«. 

(5) All the coefficients measuring the relationship between number 
of pods and number of seeds are lowered by thus correcting for rv> and 
ro,, that is cr P s < r P s, always. Several of the partial correlations have 
the negative sign. Their mean value while very small, »n>s = + .051, 
is apparently significantly positive. 

(6) Thus on the average, there is some correlation between the 
numbers of pods per plant and the number of ovules which develop 
into seeds which is in part at least independent of — although it may 
be inseparably bound up with — the morphogenetic factors which link 
together the magnitudes of the two characters p and 0. This corre- 
lation which must have its origin in the factors underlying the fertili- 
zation of the ovule and its nutrition during the period of growth into a 
seed, I have ventured to designate as more truly physiological, al- 
though there is probably in reality no sharp line of demarcation be- 
tween physiological and morphogenetic in problems of the kind under 
consideration here. 

Carnegie Institution of Washington. 
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